SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Davis, Roger J. 

Raingeaud, Joel 
Gupta, Shashi 
Derijard, Benoit 

(ii) TITLE OF INVENTION: CYTOKINE - , STRESS- , AND ONCOPROTEIN- 
ACTIVATED HUMAN PROTEIN KINASE KINASES 

(iii) NUMBER OF SEQUENCES: 16 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish &. Richardson P.C. 

„ (B) STREET: 225 Franklin Street 

" (C) CITY: Boston 

H (D) STATE: Massachusetts 

H (E) COUNTRY: USA 

j 1 ; (F) ZIP: 02110 

m (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
\j (b) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

□ (D) SOFTWARE : WordPerfect 5.1 

(vii) PRIOR APPLICATION DATA: 
in (A) APPLICATION NUMBER: 08/446,083 

□ (B) FILING DATE: 05/19/95 

?= (viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Fasse, J. Peter 

(B) REGISTRATION NUMBER: 32,983 

(C) REFERENCE/DOCKET NUMBER: 04020/09001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 542-5070 

(B) TELEFAX: (617) 542-8906 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2030 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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TGGCTGGCAA TGGCCTTGCT GACCTCGAGC 
CCTACGATCC TGGTGCAAGG CCGGTGGATG 
GCGAGGAGCG TGGTCCCCAC CCATCCAGCC 
TGGTCATATC CATGGTGACC ATTTATGGGC 
CTGTGCTGAG CACCTTGCAG ACGTGATCTT 
AAAATCCAAG AGGAAGAAGG ATCTACGGAT 
CCCCACACCC CCCCGGAACC TGGACTCCCG 
TGAGGTGGAG GCTGATGACT TGGTGACCAT 
GGTAGAGAAG GTGCGGCACG CCCAGAGCGG 
CACCGTGAAC TCACAGGAGC AGAAGCGGCT 
GGTCGACTGT TTCTACACTG TCACCTTCTA 
GATCTGCATG GAGCTCATGG ACACATCCTT 
AAACATGACA ATTCCAGAGG ACATCCTTGG 
GGAGCATCTG CACAGCAAGC TGTCGGTGAT 
TATCAACAAG GAGGGCCATG TGAAGATGTG 
CTCTGTGGCC AAGACGATGG ATGCCGGCTG 
CCCAGAGCTG AACCAGAAGG GCTACAATGT 
ClATGATTGAG ATGGCCATCC TGCGGTTCCC 
\ SCTGAAGCAG GTGGTGGAGG AGCCGTCCCC 
'GTTTGTGGAC TTCACTGCTC AGTGCCTGAG 
(5GAGCTGATG GAGCACCCCT TCTTCACCTT 
"CTTCGTGAAG AAGATCCTGG GAGAAGACTC 
CCCTCCAGAG CCCCACAGCC CCATCTGCGG 
*iSCCA.TCCTGG CCCAGGGCAT CTGGGAGGAA 
=GCGAGCCATT TGTCCCAAGT GCCAAAGAAG 
"GTCGGCCCCA CCAGTGCCTC TCCCTGCTGC 
-CTGGACTGAG GGGGCCTGGA TGCCCCCTGT 
"CAGTGCCTGG GTGGATGGGC CACCGCCTTG 
TTTTTTAATC TCTCGACTGA ATGGACTTTG 
ATCCCGGCTT TGGTGCGGGG TACACAAGAG 
[CCATGAGGGA GATGCCATGA GCCGCCCAAG 
<:tgcggagca CACTGGCTCA CCCAGTCCTG 
""TCGTGTTTTT TTTAATTTAT CCTCTGTTGA 
TTTTTCTTGC ATGGTTTGGA GCTGATCGCT 



CGGGCCCACG TGGGGACCTT TGGAGCACAG 60 
CAGAGGCCAG TCCATATACC ACCCAGGCCT 120 
CATATGTGCA AGTGCCCTTG ACAGAGAGGC 180 
CACAACAGGT CCCCATCTGC GCAGTGAACC 240 
GCTTCGTCCT GCAGCACTGT GCGGGGCAGG 300 
ATCCTGCATG TCCAAGCCAC CCGCACCCAA 3 60 
GACCTTCATC ACCATTGGAG ACAGAAACTT 420 
CTCAGAACTG GGCCGTGGAG CCTATGGGGT 480 
CACCATCATG GCCGTGAAGC GGATCCGGGC 540 
GCTCATGGAC CTGGACATCA ACATGCGCAC 600 
CGGGGCACTA TTCAGAGAGG GAGACGTGTG 660 
GGACAAGTTC TACCGGAAGG TGCTGGATAA 720 
GGAGATTGCT GTGTCTATCG TGCGGGCCCT 780 
CCACAGAGAT GTGAAGCCCT CCAATGTCCT 840 
TGACTTTGGC ATCAGTGGCT ACTTGGTGGA 900 
CAAGCCCTAC ATGGCCCCTG AGAGGATCAA 960 
CAAGTCCGAC GTCTGGAGCC TGGGCATCAC 1020 
TTACGAGTCC TGGGGGACCC CGTTCCAGCA 108a 
CCAGCTCCCA GCCGACCGTT TCTCCCCCGA 1140 
GAAGAACCCC GCAGAGCGTA TGAGCTACCT 1200 
GCACAAAACC AAGAAGACGG ACATTGCTGC 1260 
ATAGGGGCTG GGCCTCGGAC CCCACTCCGG 1320 
GGGCAGTGCT CACCCACACC ATAAGCTACT 1380 
CCGAGGGGGC TGCTCCCACC TGGCTCTGTG 1440 
CAGACCATTG GGGCTCCCAG CCAGGCCCTT 1500 
TCCTAGGACC CGTCTCCAGC TGCTGAGATC 1560 
GGATGCTGCT GCCCCTGCAC AGCAGGCTGC 1620 
CCCAGCCTGG ATGCCATCCA AGTTGTATAT 1680 
CACACTTTGG CCCAGGGTGG CCACACCTCT 1740 
GGGATGAGTT GTGTGAATAC CCCAAGACTC 1800 
GCCTTCCCCT GGCACTGGCA AACAGGGCCT 1860 
CCCGCCACCG TTATCGGTGT CATTCACCTT 1920 
TTTTTTCTTT TGCTTTATGG GTTTGGCTTG 1980 
TCTCCCCCAC CCCCTAGGGG 2030 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 363 amino acids 

(B) TYPE: peptide 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

5 10 15 

Met Ser Lys Pro Pro Ala Pro Asn Pro Thr Pro Pro Arg Asn Leu Asp 

20 25 30 

Ser Arg Thr Phe He Thr He Gly Asp Arg Met Phe Glu Val Glu Ala 

35 40 45 

Asp Asp Leu Val Thr He Ser Glu Leu Gly Arg Gly Ala Tyr Gly Val 
50 55 60 
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Val Glu Lys Val Arg His Ala Gin Ser Gly Thr He Met Ala Val Lys 
6 c 70 * 75 SO 

Ara He Arg Ala Thr Val Asn Ser Gin Glu Gin Lys Arg Leu Leu Met 

85 90 95 

Asp Leu Asp ILe Asn Met Arg Thr Val Asp Cys Phe Tyr Thr Val Thr 

100 105 110 

Phe Tyr Gly Ala Leu Phe Arg Glu Gly Asp Val Trp lie Cys Met Glu 

Leu Met Asp Thr Ser Leu Asp Lys Phe Tyr Arg Lys Val Leu Asp Lys 

Asn Met Thr He Pro Glu Asp He Leu Gly Glu He Ala Val Ser lie 

145 150 155 # 160 

Val Arg Ala Leu Glu His Leu His Ser Lys Leu Ser Val He His Arg 

165 170 175 

Asp Val Lys Pro Ser Asn Val Leu He Asn Lys Glu Gly His Val Lys 

200 205 210 

Met Cys Asp Phe Gly He Ser Gly Tyr Leu Val Asp Ser Val Ala Lys 

n 215 220 225 

Shr Met Asp Ala Gly Cys Lys Pro Tyr Met Ala Pro Glu Arg He Asn 

u 23 0 235 240 

iiro Glu Leu Asn Gin Lys Gly Tyr Asn Val Lys Ser Asp Val Trp Ser 

134 S 250 255 » Si 

lOeu Gly He Thr Met He Glu Met Ala He Leu Arg Phe Pro Tyr Glu 

265 270 275 

"ser Trp Gly Thr Pro Phe Gin Gin Leu Lys Gin Val Val Glu Glu Pro 

280 285 290 

Jfer Pro Gin Leu Pro Ala Asp Arg Phe Ser Pro Glu Phe Val Asp Phe 

fthr Ala Gin Cys Leu Arg Lys Asn Pro Ala Glu Arg Met Ser Tyr Leu 

fi 310 315 320 

C Slu Leu Met Glu His Pro Phe Phe Thr Leu His Lys Thr Lys Lys Thr 

4 2 5 330 335 3 

^isp He Ala Ala Phe Val Lys Lys He Leu Gly Glu Asp Ser 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: en 
TAGCTGCAGC ACAGCCTTCC CTAACGTTGC AACTGGGGGA AAAATCACTT TCCAGTCTGT 60 
TTTGCAAGGT GTGCATTTCC ATCTTGATTC CCTGAAAGTC CATCTGCTGC ATCGGTCAAG 120 
IgaSotSX CTTGCATGAA GATTGCACGC CTGCAGCTTG CATCTTTGTT GCAAAACTAG 180 
CTACAGAAGA GAAGCAAGGC AAAGTCTTTT GTGCTCCCCT CCCCCATCAA AGGAAAGGGG 240 
SSgTCTC AGTCGAAAGG CAAGAAGCGA AACCCTGGCC TTAAAATTCC AAAAGAAGCA 300 
T^aIcAAC CTCAGACCAG TTCCACACCA CCTAGAGATT TAGACTCCAA GGCTTGCATT 360 
TCTATTGGAA ATCAGAACTT TGAGGTGAAG GCAGATGACC TGGAGCCTAT AATGGAACTG 420 
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GGACGAGGTG CGTACGGGGT GGTGGAGAAG ATGCGGCACG TGCCCAGCGG GCAGATCATG 480 
GCAGTGAAGC GGATCCGAGC CACAGTAAAT AGCCAGGAAC AGAAACGGCT ACTGATGGAT 540 
TTGGATATTT CCATGAGGAC GGTGGACTGT CCATTCACTG TCACCTTTTA TGGCGCACTG 600 
TTTCGGGAGG GTGATGTGTG GATCTGCATG GAGCTCATGG ATACATCACT AGATAAATTC 660 
TACAAACAAG TTATTGATAA AGGCCAGACA ATTCCAGAGG ACATCTTAGG GAAAATAGCA 720 
GTTTCTATTG TAAAAGCATT AGAACATTTA CATAGTAAGC TGTCTGTCAT TCACAGAGAC 780 
GTCAAGCCTT CTAATGTACT CATCAATGCT CTCGGTCAAG TGAAGATGTG CGATTTTGGA 840 
ATCAGTGGCT ACTTGGTGGA CTCTGTTGCT AAAACAATTG ATGCAGGTTG CAAACCATAC 900 
ATGGCCCCTG AAAGAATAAA CCCAGAGCTC AACCAGAAGG GATACAGTGT GAAGTCTGAC 960 
ATTTGGAGTC TGGGCATCAC GATGATTGAG TTGGCCATCC TTCGATTTCC CTATGATTCA 1020 
TGGGGAACTC CATTTCAGCA GCTCAAACAG GTGGTAGAGG AGCCATCGCC ACAACTCCCA 1080 
GCAGACAAGT TCTCTGCAGA GTTTGTTGAC TTTACCTCAC AGTGCTTAAA GAAGAATTCC 1140 
AAAGAACGGC CTACATACCC AGAGCTAATG CAACATCCAT TTTTCACCCT ACATGAATCC 1200 
AAAGGAACAG ATGTGGCATC TTTTGTAAAA CTGATTCTTG GAGACTAAAA AGCAGTGGAC 1260 
TTAATCGGTT GACCCTACTG TGGATTGGTG GGTTTCGGGG TGAAGCAAGT TCACTACAGC 13 20 
ATCAATAGAA AGTCATCTTT GAGATAATTT AACCCTGCCT CTCAGAGGGT TTTCTCTCCC 13 80 
AATTTTCTTT TTACTCCCCC TCTTAAGGGG GCCTTGGAAT CTATAGTATA GAATGAACTG 1440 
J^CTAGATGGA TGAATTATGA TAAAGGCTTA GGACTTCAAA AGGTGATTAA ATATTTAATG 150 a 
KTGTGTCATA TGAGTCCTCA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1560 
jSgAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 1602 

H2) INFORMATION FOR SEQ ID NO:4: 

11 (i) SEQUENCE CHARACTERISTICS: 

Crt (A) LENGTH: 344 amino acids 

d (B) TYPE: peptide 

■ (D) TOPOLOGY: linear 

£3 (ii) MOLECULE TYPE: peptide 

M (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 











5 










10 




15 




{Met 


Ser 


Gin 


Ser 


Lys 


Gly 


Lys 


Lys 


Arg 


Asn 


Pro 


Gly Leu Lys He 


Pro 








20 










25 






30 






Glu 


Ala 


Phe 


Glu 


Gin 


Pro 


Gin 


Thr 


Ser 


Ser 


Thr Pro Pro Arg 


Asp 




35 










40 








45 




Leu 


Asp 


Ser 


Lys 


Ala 


Cys 


He 


Ser 


He 


Gly 


Asn 


Gin Asn Phe Glu 


Val 




50 










55 










60 




Lys 


Ala 


Asp 


Asp 


Leu 


Glu 


Pro 


He 


Met 


Glu 


Leu 


Gly Arg Gly Ala 


Tyr 


65 










70 










75 




80 


Gly 


Val 


Val 


Glu 


Lys 


Met 


Arg 


His 


Val 


Pro 


Ser 


Gly Gin lie Met 


Ala 








85 










90 




95 




Val 


Lys 


Arg 


He 


Arg 


Ala 


Thr 


Val 


Asn 


Ser 


Gin 


Glu Gin Lys Arg 


Leu 






100 










105 






110 




Leu 


Met 


Asp 


Leu 


Asp 


He 


Ser 


Met 


Arg 


Thr 


Val 


Asp Cys Pro Phe 


Thr 






115 










120 








125 




Val 


Thr 


Phe 


Tyr 


Gly 


Ala 


Leu 


Phe 


Arg 


Glu 


Gly 


Asp Val Trp He 


Cys 




130 










135 










140 




Met 


Glu 


Leu 


Met 


Asp 


Thr 


Ser 


Leu 


Asp 


Lys 


Phe 


Tyr Lys Gin Val 


He 


145 










150 










155 




160 


Asp 


Lys 


Gly 


Gin 


Thr 


He 


Pro 


Glu 


Asp 


He 


Leu 


Gly Lys He Ala 


Val 
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iO 3 










1 7fi 










1/3 




Ser 


He 


Val 


Lys 


Ala 


Leu 


GlU 


HIS 


Leu 


nlS 


O a v 

oer 


Lys 


Leu 


Ser 


Val 


lie 








ion 

ISO 










loo 










150 






His 


Arg 


Asp 


Val 


Lys 


Pro 


Ser 


Asn 


Val 


Leu 


lie 


Asn 


Ala 


Leu 


Gly 


Gin 






195 




















n n c 

205 








Val 


Lys 


Met 


Cys 


Asp 


Phe 


Gly 


He 


Ser 


Gly 


Tyr 


Leu 


Val 


Asp 


Ser 


Val 




210 




















z ^ u 










Ala 


Lys 


Thr 


He 


Asp 


Ala 


Gly 


Cys 


Lys 


Pro 


Tyr 


Met 


Ala 


Pro 


Glu 


Arg 


225 










H A 










OTP 










240 


lie 


Asn 


Pro 


Glu 


Leu 


Asn 


Gin 


Lys 


Gly 


Tyr 


Ser 


Val 


Lys 


Ser 


Asp 


lie 




















O cr n 














Trp 


Ser 


Leu 


Gly 


lie 


Tnr 


Met 


lie 


IjIU 


Leu 


Aia 


lie 


Leu 


Arg 


Fne 


Pro 






260 










o c c 
2 bo 










O T A 

2 /U 






Tyr 


Asp 


Ser 


Trp 


Gly 


Thr 


Pro 




uln 


bin 


Leu 


Lys 


Gin 


val 


val 


CjIU 


275 










280 










285 








Glu 


Pro 


Ser 


Pro 


Gin 


Leu 


Pro 


Ala 


Asp 


Lys 


Phe 


Ser 


Ala 


Glu 


Phe 


Val 




290 










295 










inn 










Asp 


Phe 


Thr 


Ser 


Gin 


Cys 


Leu 


Lys 


Lys 


Asn 


Ser 


Lys 


Glu 


Arg 


Pro 


Thr 












310 










315 










320 


rfyr 


Pro 


Glu 


Leu 


Met 


Gin 


His 


Pro 


Phe 


Phe 


Thr 


Leu 


His 


Glu 


Ser 


Lys 










325 










330 








334 






[fly 


Thr 


Asp 


Val 


Ala 


Ser 


Phe 


Val 


Lys 


Leu 


He 


Leu 


Gly 


Asp 






HU) 


INFORMATION 


FOR 


SEQ 


ID NO:5; 



















*"* (i) SEQUENCE CHARACTERISTICS: 

!L (A) LENGTH: 3498 base pairs 

p (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
H- L (D) TOPOLOGY: linear 

W (ii) MOLECULE TYPE: DNA 

0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

^CTAGGGTCCC CGGCGCCAGG CCACCCGGCC GTCAGCAGCA TGCAGGGTAA ACGCAAAGCA 60 

CTGAAGTTGA ATTTTGCAAA TCCACCTTTC AAATCTACAG CAAGGTTTAC TCTGAATCCC 120 

AATCCTACAG GAGTTCAAAA CCCACACATA GAGAGACTGA GAACACACAG CATTGAGTCA 180 

TCAGGAAAAC TGAAGATCTC CCCTGAACAA CACTGGGATT TCACTGCAGA GGACTTGAAA 240 

GACCTTGGAG AAATTGGACG AGGAGCTTAT GGTTCTGTCA ACAAAATGGT CCACAAACCA 300 

AGTGGGCAAA TAATGGCAGT TAAAAGAATT CGGTCAACAG TGGATGAAAA AGAACAAAAA 360 

CAACTTCTTA TGGATTTGGA TGTAGTAATG CGGAGTAGTG ATTGCCCATA CATTGTTCAG 420 

TTTTATGGTG CACTCTTCAG AGAGGGTGAC TGTTGGATCT GTATGGAACT CATGTCTACC 480 

TCGTTTGATA AGTTTTACAA ATATGTATAT AGTGTATTAG ATGATGTTAT TCCAGAAGAA 540 

ATTTTAGGCA AAATCACTTT AGCAACTGTG AAAGCACTAA ACCACTTAAA AGAAAACTTG 600 

AAAATTATTC ACAGAGATAT CAAACCTTCC AATATTCTTC TGGACAGAAG TGGAAATATT 660 

AAGCTCTGTG ACTTCGGCAT CAGTGGACAG CTTGTGGACT CTATTGCCAA GACAAGAGAT 720 

GCTGGCTGTA GGCCATACAT GGCACCTGAA AGAATAGACC CAAGCGCATC ACGACAAGGA 780 

TATGATGTCC GCTCTGATGT CTGGAGTTTG GGGATCACAT TGTATGAGTT GGCCACAGGC 840 

CGATTTCCTT ATCCAAAGTG GAATAGTGTA TTTGATCAAC TAACACAAGT CGTGAAAGGA 900 

GATCCTCCGC AGCTGAGTAA TTCTGAGGAA AGGGAATTCT CCCCGAGTTT CATCAACTTT 960 

GTCAACTTGT GCCTTACGAA GGATGAATCC AAAAGGCCAA AGTATAAAGA GCTTCTGAAA 1020 

CATCCCTTTA TTTTGATGTA TGAAGAACGT GCCGTTGAGG TCGCATGCTA TGTTTGTAAA 1080 
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ATCCTGGATC AAATGCCAGC TACTCCCAGC TCTCCCATGT ATGTCGATTG ATATCGYTGC 1140 
TACATCAGAC TCTAGAAAAA AGGGCTGAGA GGAAGCAAGA CGTAAAGAAT TTTCATCCCG 1200 
TATCACAGTG TTTTTATTGC TCGCCCAGAC ACCATGTGCA ATAAGATTGG TGTTCGTTTC 1260 
CATCATGTCT GTATACTCCT GTCACCTAGA ACGTGCATCC TTGTAATACC TGATTGATCA 1320 
CACAGTGTTA GTGCTGGTCA GAGAGACCTC ATCCTGCTCT TTTGTGATGA ACATATTCAT 1380 
GAAATGTGGA AGTCAGTACG ATCAAGTTGT TGACTGTGAT TAGATCACAT CTTAAATTCA 1440 
TTTCTAGACT CAAAACCTGG AGATGCAGCT ACTGGAATGG TGTTTTGTCA GACTTGCAAA 1500 
TCCTGGAAGG ACACAGTGAT GAATGTACTA TATCTGAACA TAGAAACTCG GGCTTGAGTG 1560 
AGAAGAGCTT GCACAGCCAA CGAGACACAT TGCCTTCTGG AGCTGGGAGA CAAAGGAGGA 1620 
ATTTACTTTC TTCACCAAGT GCAATAGATT ACTGATGTGA TATTCTGTTG CTTTACAGTT 1680 
ACAGTTGATG TTTGGGGATC GATGTGCTCA GCCAAATTTC CTGTTTGAAA TATCATGTTA 1740 
AATTAGAATG AATTTATCTT TACCAAAAAC CATGTTGCGT TCAAAGAGGT GAACATTAAA 1800 
ATATAGAGAC AGGACAGAAT GTGTTCTTTT CTCCTCTACC AGTCCTATTT TTCAATGGGA 1860 
AGACTCAGGA GTCTGCCACT TGTCAAAGAA GGTGCTGATC CTAAGAATTT TTCATTCTCA 1920 
GAATTCGGTG TGCTGCCAAC TTGATGTTCC ACCTGCCACA AACCACCAGG ACTGAAAGAA 1980 
GAAAACAGTA CAGAAGGCAA AGTTTACAGA TGTTTTTAAT TCTAGTATTT TATCTGGAAC 2040 
AACTTGTAGC AGCTATATAT TTCCCCTTGG TCCCAAGCCT GATACTTTAG CCATCATAAC 2100 
tgACTAACAG GGAGAAGTAG CTAGTAGCAA TGTGCCTTGA TTGATTAGAT AAAGATTTCT 216 a 
AfTAGGCAGC AAAAGACCAA ATCTCAGTTG TTTGCTTCTT GCCATCACTG GTCCAGGTCT 2220 
TfAGTTTCCG AATCTCTTTC CCTTCCCCTG TGGTCTATTG TCGCTATGTG ACTTGCGCTT 2280 
a|lTCCAATAT TTTGCCTTTT TTCTATATCA AAAAACCTTT ACAGTTAGCA GGGATGTTCC 2340 
■PTACCGAGGA TTTTTAACCC CCAATCTCTC ATAATCGCTA GTGTTTAAAA GGCTAAGAAT 2400 
ApTGGGGCCC AACCGATGTG GTAGGTGATA AAGAGGCATC TTTTCTAGAG ACACATTGGA 2460 
SfcAGATGAGG ATCCGAAACG GCAGCCTTTA CGTTCATCAC CTGCTAGAAC CTCTCGTAGT 2520 
BCATCACCAT TTCTTGGCAT TGGAATTCTA CTGGAAAAAA ATACAAAAAG CAAAACAAAA 2580 
bCCTCAGCAC TGTTACAAGA GGCCATTTAA GTATCTTGTG CTTCTTCACT TACCCATTAG 2640 
fcCAGGTTCTC ATTAGGTTTT GCTTGGGCCT CCCTGGCACT GAACCTTAGG CTTTGTATGA 2700 
fckGTGAAGCA GCACTGTGAG TGGTTCAAGC ACACTGGAAT ATAAAACAGT CATGGCCTGA 2760 
faATGCAGGTG ATGCCATTAC AGAACCAAAT CGTGGCACGT ATTGCTGTGT CTCCTCTCAG 2820 
ftSTGACAGTC ATAAATACTG TCAAACAATA AAGGGAGAAT GGTGCTGTTT AAAGTCACAT 2880 
gbCTGTAAAT TGCAGAATTC AAAAGTGATT ATCTCTTTGA TCTACTTGCC TCATTTCCCT 2940 
^TCTTCTCCC CCACGGTATC CTAAACTTTA GACTTCCCAC TGTTCTGAAA GGAGACATTG 3000 
ICTCTATGTCT GCCTTCGACC ACAGCAAGCC ATCATCCTCC ATTGCTCCCG GGGACTCAAG 3060 
AGGAATCTGT TTCTCTGCTG TCAACTTCCC ATCTGGCTCA GCATAGGGTC ACTTTGCCAT 3120 
TATGCAAATG GAGATAAAAG CAATTCTGGC TGTCCAGGAG CTAATCTGAC CGTTCTATTG 3180 
TGTGGATGAC CACATAAGAA GGCAATTTTA GTGTATTAAT CATAGATTAT TATAAACTAT 3 240 
AAACTTAAGG GCAAGGAGTT TATTACAATG TATCTTTATT AAAACAAAAG GGTGTATAGT 3300 
GTTCACAAAC TGTGAAAATA GTGTAAGAAC TGTACATTGT GAGCTCTGGT TATTTTTCTC 3360 
TTGTACCATA GAAAAATGTA TAAAAATTAT CAAAAAGCTA ATGTGCAGGG ATATTGCCTT 3420 
ATTTGTCTGT AAAAAATGGA GCTCAGTAAC ATAACTGCTT CTTGGAGCTT TGGAATATTT 3480 
TATCCTGTAT TCTTGTTT 3498 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: "amino acids 

(B) TYPE: peptide 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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Met Gin Gly Lys 
20 

Phe Lys Ser Thr 
35 

Gin Asn Pro His 
50 

Gly Lys Leu Lys 
65 

Asp Leu Lys Asp 

Asn Lys Met Val 
100 

lie Arg Ser Thr 
115 

Leu Asp Val Val 
130 

Cfyr Gly Ala Leu 
%'U5 

%et Ser Thr Ser 

lAsp Asp Val He 
ir| 180 
f : Val Lys Ala Leu 
:>! 195 
= Asp He Lys Pro 
210 

"leu Cys Asp Phe 
225 

Jhr Arg Asp Ala 

-Pro Ser Ala Ser 
260 

Leu Gly He Thr 
275 

Lys Trp Asn Ser 
290 

Pro Pro Gin Leu 
305 

He Asn Phe Val 

Lys Tyr Lys Glu 
340 

Arg Ala Val Glu 
355 

Pro Ala Thr Pro 



TV 

Arg 


Lys 


Aia 


Leu 


Ala 


Arg 


Phe 


Thr 








40 


He 


GlU 


Arg 


Leu 






55 




lie 


Ser 


Pro 


Pin 
GlU 




70 






Leu 


Giy 


GlU 


lie 


85 








rilS 


Lys 


Pro 




Val 


Asp 


Glu 


Lys 








120 


Met 


Arg 


Ser 


O A V* 






1-3 D 




Pne 


Arg 


GlU 


Giy 




150 






Pne 


Asp 


Lys 


nU fl 

xr US 


165 








Pro 


GlU 


GlU 


lie 


Asn 


His 


Leu 


Lys 








200 


Ser 


Asn 


lie 


Leu 






215 




Gly 


He 


Ser 


Gly 




230 






Gly 


Cys 


Arg 


Pro 


245 








Arg 


pi 

Gin 


Gly 


Tyr 


Leu 


Tyr 


Glu 


Leu 








280 


vax 


rlic 


Asp 








n a e 
295 




Ser 


Asn 


Ser 


GlU 




310 






Asn 


Leu 


Cys 


Leu 


325 








Leu 


Leu 


Lys 


His 


Val 


Ala 


Cys 


Tyr 








360 


Ser 


Ser 


Pro 


Met 



10 



Lys 


Leu 


Asn 


rne 


25 








Leu 


Asn 


Pro 


Asn 


Arg 


Thr 


His 


Ser 








60 


Gin 


HIS 


Trp 


ASp 






75 




Giy 


Arg 


Giy 


Ala 




90 






Giy 


Gin 


lie 


wee 


105 








VJXU 


uin 


Lys 


pi ry 


Asp 


Cys 


Pro 


Tyr 








140 


Asp 


P-» • 

Lys 


irp 


lie 






ICC 

lbb 




Tyr 


Lys 


Tyr 


vai 




170 






Leu 


Gly 


Lys 


lie 


IOC 

185 








pt .. 
GlU 


Asn 


Leu 


Lys 


Leu 


Asp 


Arg 


Ser 








220 


Gin 


Leu 


Val 


Asp 






235 




Tyr 


Met 


Ala 


Pro 




250 






Asp 


vai 


Arg 


Ser 


265 








Ala 


Tnr 


pi * *■ 
Gly 


Arg 




inx 


P 1 ri 


vax 








*a f\ n 
300 


Ul Li 


z*vx y 










315 




Thr 


Lys 


Asp 


Glu 




330 






Pro 


Phe 


He 


Leu 


345 








Val 


Cys 


Lys 


He 






363 




Tyr 


Val 


Asp 





Ala 


Asn 


Pro 


Pro 




30 






Pro 


Thr 


Gly 


Val 


45 








He 


Glu 


Ser 


Ser 


Phe 


Thr 


Ala 


Glu 








80 


Tyr 


Gly 


Ser 


Val 






95 




Ala 


Val 


Lys 


Arg 




110 






Leu 


Leu 


Met 


Asp 


125 








Tl _ 

lie 


val 


Pl VI 

Gin 


pne 


Cys 


Met 


Glu 


Leu 






160 


Tyr 


Ser 


vai 


Leu 






175 




Tnr 


Leu 


Ala 


Thr 




190 






lie 


Tl — 

lie 


HIS 


Arg 


205 








Gly 


Asn 




Lys 


Ser 


He 


Ala 


Lys 








240 


Glu 


Arg 


He 


Asp 






o r* 
255 




ASp 


vai 


Trp 


Ser 




270 






Pne 


Pro 


Tyr 


Pro 


285 








V AX 


ijy o 


wx y 




Ser 


Pro 


Ser 


Phe 








320 


Ser 


Lys 


Arg 


Pro 






335 




Met 


Tyr 


Glu 


Glu 




350 






Leu 


Asp 


Gin 


Met 



(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 3554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CAACAATGGC GGCTCCGAGC CCGAGCGGTG GCGGCGGCAG CGGCACCCCC GGCCCCGTAG 60 
GGTCCCCGGC GCCAGGCCAC CCGGCCGTCA GCAGCATGCA GGGTAAACGC AAAGCACTGA 120 
AGTTGAATTT TGCAAATCCA CCTTTCAAAT CTACAGCAAG GTTTACTCTG AATCCCAATC 180 
CTACAGGAGT TCAAAACCCA CACATAGAGA GACTGAGAAC ACACAGCATT GAGTCATCAG 240 
GAAAACTGAA GATCTCCCCT GAACAACACT GGGATTTCAC TGCAGAGGAC TTGAAAGACC 3 00 
TTGGAGAAAT TGGACGAGGA GCTTATGGTT CTGTCAACAA AATGGTCCAC AAACCAAGTG 360 
GGCAAATAAT GGCAGTTAAA AGAATTCGGT CAACAGTGGA TGAAAAAGAA CAAAAACAAC 420 
TTCTTATGGA TTTGGATGTA GTAATGCGGA GTAGTGATTG CCCATACATT GTTCAGTTTT 480 
ATGGTGCACT CTTCAGAGAG GGTGACTGTT GGATCTGTAT GGAACTCATG TCTACCTCGT 540 
TTGATAAGTT TTACAAATAT GTATATAGTG TATTAGATGA TGTTATTCCA GAAGAAATTT 600 
^GGCAAAAT CACTTTAGCA ACTGTGAAAG CACTAAACCA CTTAAAAGAA AACTTGAAAA 660 
TJTATTCACAG AGATATCAAA CCTTCCAATA TTCTTCTGGA CAGAAGTGGA AATATTAAGC 720* 
JrfcTGTGACTT CGGCATCAGT GGACAGCTTG TGGACTCTAT TGCCAAGACA AGAGATGCTG 780 
ijfCTGTAGGCC ATACATGGCA CCTGAAAGAA TAGACCCAAG CGCATCACGA CAAGGATATG 840 
^TGTCCGCTC TGATGTCTGG AGTTTGGGGA TCACATTGTA TGAGTTGGCC ACAGGCCGAT 900 
^TTCCTTATCC AAAGTGGAAT AGTGTATTTG ATCAACTAAC ACAAGTCGTG AAAGGAGATC 960 
!§TCCGCAGCT GAGTAATTCT GAGGAAAGGG AATTCTCCCC GAGTTTCATC AACTTTGTCA 1020 - 
^CTTGTGCCT TACGAAGGAT GAATCCAAAA GGCCAAAGTA TAAAGAGCTT CTGAAACATC 1080 
^ICTTTATTTT GATGTATGAA GAACGTGCCG TTGAGGTCGC ATGCTATGTT TGTAAAATCC 1140 
-TGGATCAAAT GCCAGCTACT CCCAGCTCTC CCATGTATGT CGATTGATAT CGYTGCTACA 1200 
If CAGACTCTA GAAAAAAGGG CTGAGAGGAA GCAAGACGTA AAGAATTTTC ATCCCGTATC 1260 
IACAGTGTTTT TATTGCTCGC CCAGACACCA TGTGCAATAA GATTGGTGTT CGTTTCCATC 1320 
IATGTCTGTAT ACTCCTGTCA CCTAGAACGT GCATCCTTGT AATACCTGAT TGATCACACA 1380 
CSTGTTAGTGC TGGTCAGAGA GACCTCATCC TGCTCTTTTG TGATGAACAT ATTCATGAAA 1440 
f JGTGGAAGTC AGTACGAfCA AGTTGTTGAC TGTGATTAGA TCACATCTTA AATTCATTTC 1500 
iTAGACTCAAA ACCTGGAGAT GCAGCTACTG GAATGGTGTT TTGTCAGACT TCCAAATCCT 1560 
= GGAAGGACAC AGTGATGAAT GTACTATATC TGAACATAGA AACTCGGGCT TGAGTGAGAA 1620 
GAGCTTGCAC AGCCAACGAG ACACATTGCC TTCTGGAGCT GGGAGACAAA GGAGGAATTT 1680 
ACTTTCTTCA CCAAGTGCAA TAGATTACTG ATGTGATATT CTGTTGCTTT ACAGTTACAG 1740 
TTGATGTTTG GGGATCGATG TGCTCAGCCA AATTTCCTGT TTGAAATATC ATGTTAAATT 1800 
AGAATGAATT TATCTTTACC AAAAACCATG TTGCGTTCAA AGAGGTGAAC ATTAAAATAT 1860 
AGAGACAGGA CAGAATGTGT TCTTTTCTCC TCTACCAGTC CTATTTTTCA ATGGGAAGAC 1920 
TCAGGAGTCT GCCACTTGTC AAAGAAGGTG CTGATCCTAA GAATTTTTCA TTCTCAGAAT 1980 
TCGGTGTGCT GCCAACTTGA TGTTCCACCT GCCACAAACC ACCAGGACTG AAAGAAGAAA 2040 
ACAGTACAGA AGGCAAAGTT TACAGATGTT TTTAATTCTA GTATTTTATC TGGAACAACT 2100 
TGTAGCAGCT ATATATTTCC CCTTGGTCCC AAGCCTGATA CTTTAGCCAT CATAACTCAC 2160 
TAACAGGGAG AAGTAGCTAG TAGCAATGTG CCTTGATTGA TTAGATAAAG ATTTCTAGTA 222 0 
GGCAGCAAAA GACCAAATCT CAGTTGTTTG CTTCTTGCCA TCACTGGTCC AGGTCTTCAG 2280 
TTTCCGAATC TCTTTCCCTT CCCCTGTGGT CTATTGTCGC TATGTGACTT GCGCTTAATC 2340 
CAATATTTTG CCTTTTTTCT ATATCAAAAA ACCTTTACAG TTAGCAGGGA TGTTCCTTAC 2400 
CGAGGATTTT TAACCCCCAA TCTCTCATAA TCGCTAGTGT TTAAAAGGCT AAGAATAGTG 2460 
GGGCCCAACC GATGTGGTAG GTGATAAAGA GGCATCTTTT CTAGAGACAC ATTGGACCAG 2520 
ATGAGGATCC GAAACGGCAG CCTTTACGTT CATCACCTGC TAGAACCTCT CGTAGTCCAT 2580 
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CACCATTTCT TGGCATTGGA ATTCTACTGG AAAAAAATAC AAAAAGCAAA ACAAAACCCT 2640 
CAGCACTGTT ACAAGAGGCC ATTTAAGTAT CTTGTGCTTC TTCACTTACC CATTAGCCAG 2700 
GTTCTCATTA GGTTTTGCTT GGGCCTCCCT GGCACTGAAC CTTAGGCTTT GTATGACAGT 2760 
GAAGCAGCAC TGTGAGTGGT TCAAGCACAC TGGAATATAA AACAGTCATG GCCTGAGATG 2820 
CAGGTGATGC CATTACAGAA CCAAATCGTG GCACGTATTG CTGTGTCTCC TCTCAGAGTG 2880 
ACAGTCATAA ATACTGTCAA ACAATAAAGG GAGAATGGTG CTGTTTAAAG TCACATCCCT 2940 
GTAAATTGCA GAATTCAAAA GTGATTATCT CTTTGATCTA CTTGCCTCAT TTCCCTATCT 3 000 
TCTCCCCCAC GGTATCCTAA ACTTTAGACT TCCCACTGTT CTGAAAGGAG ACATTGCTCT 3060 
ATGTCTGCCT TCGACCACAG CAAGCCATCA TCCTCCATTG CTCCCGGGGA CTCAAGAGGA 3120 
ATCTGTTTCT CTGCTGTCAA CTTCCCATCT GGCTCAGCAT AGGGTCACTT TGCCATTATG 3180 
CAAATGGAGA TAAAAGCAAT TCTGGCTGTC CAGGAGCTAA TCTGACCGTT CTATTGTGTG 3240 
GATGACCACA TAAGAAGGCA ATTTTAGTGT ATTAATCATA GATTATTATA AACTATAAAC 3300 
TTAAGGGCAA GGAGTTTATT ACAATGTATC TTTATTAAAA CAAAAGGGTG TATAGTGTTC 3360 
ACAAACTGTG AAAATAGTGT AAGAACTGTA CATTGTGAGC TCTGGTTATT TTTCTCTTGT 3420 
ACCATAGAAA AATGTATAAA AATTATCAAA AAGCTAATGT GCAGGGATAT TGCCTTATTT 3480 
GTCTGTAAAA AATGGAGCTC AGTAACATAA CTGCTTCTTG GAGCTTTGGA ATATTTTATC 3540 
CTGTATTCTT GTTT 3554 

Il2) INFORMATION FOR SEQ ID NO : 8 : 

Q (i) SEQUENCE CHARACTER I ST I CS : 

m (A) LENGTH: -4-e<^mino acids 

11 (B) TYPE: peptide 

h (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

*- (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

w 5 10 15 



".Met 


Ala 


Ala 


Pro 


Ser 


Pro 


Ser 


Gly 


Gly 


Gly 


Gly 


Ser Gly Thr Pro Gly 








20 










25 






30 


f'Pro 


Val 


Gly 


Ser 


Pro 


Ala 


Pro 


Gly 


His 


Pro 


Ala 


Val Ser Ser Met Gin 






35 










40 








45 


j%ly 


Lys 


Arg 


Lys 


Ala 


Leu 


Lys 


Leu 


Asn 


Phe 


Ala 


Asn Pro Pro Phe Lys 




50 










55 










60 


Nger 


Thr 


Ala 


Arg 


Phe 


Thr 


Leu 


Asn 


Pro 


Asn 


Pro 


Thr Gly Val Gin Asn 


65 










70 










75 


80 


Pro 


His 


He 


Glu 


Arg 


Leu 


Arg 


Thr 


His 


Ser 


He 


Glu Ser Ser Gly Lys 










85 










90 




95 


Leu 


Lys 


He 


Ser 


Pro 


Glu 


Gin 


His 


Trp 


Asp 


Phe 


Thr Ala Glu Asp Leu 






100 










105 






110 


Lys 


Asp 


Leu 


Gly 


Glu 


He 


Gly 


Arg 


Gly 


Ala 


Tyr 


Gly Ser Val Asn Lys 






115 










120 








125 


Met 


Val 


His 


Lys 


Pro 


Ser 


Gly 


Gin 


He 


Met 


Ala 


Val Lys Arg He Arg 




130 










135 










140 


Ser 


Thr 


Val 


Asp 


Glu 


Lys 


Glu 


Gin 


Lys 


Gin 


Leu 


Leu Met Asp Leu Asp 


145 










150 










155 


160 


Val 


Val 


Met 


Arg 


Ser 


Ser 


Asp 


Cys 


Pro 


Tyr 


lie 


Val Gin Phe Tyr Gly 










165 










170 




175 


Ala 


Leu 


Phe 


Arg 


Glu 


Gly 


Asp 


Cys 


Trp 


He 


Cys 


Met Glu Leu Met Ser 








180 










185 






190 


Thr 


Ser 


Phe 


Asp 


Lys 


Phe 


Tyr 


Lys 


Tyr 


Val 


Tyr 


Ser Val Leu Asp Asp 
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210 
Ala Leu 
225 



290 
He Thr 
305 



195 








200 












Val 




Pro 


Glu 


Glu 


He 


Leu Gly 


Lys 


lie 


Tnr Leu 


Ala 


i nr 


T 

i-tys 








215 






Zi ^ u 








He 


Met 


His 


Leu 


Lys 


Glu Asn 


Leu 


Lys 


lie He 


HIS 


Arg 


Asp 






230 








9 ^ ^ 

£» J O 








240 


Ser 


Asn 


He 


Leu 


Leu Asp 


Arg 


Ser 


Gly Met 


lie 


Lys 


T Ait 

Leu 


Cys 




245 








o c n 








Z j j 




Gly 


He 


Ser 


Gly Gin Leu 


Val 


Asp 


Ser lie 


Ala 


Lys 


inr 


Arg 


260 








26b 








Z / U 


tr lit; 




Gly 


Cys 


Arg 


Pro 


Tyr Met 


Ala 


Pro 


vaiu Arg 


lie 


Hop 




275 






280 








0 S 






Gly 


Arg 


Gin 


Gly 


Tyr Asp Val 


Arg 


O a v* 

oci 


ASp val 


irp 


Cay 

OCX 


T .at i 






295 






"3 no 










Leu 


Tyr 


Glu 


Leu 


Ala Thr 


Gly 


Arg 


irne riO 


Tyr 


rxO 




Trp 




310 
















320 


Val 


Phe 


Asp 


Gin 


Leu Thr 


Gin 


val 


vai Lys 


c^iy 


Asp 


Pro 


Pro 




325 








-J ^ V 








335 




Ser 


Asn 


Ser 


Glu 


Glu Arg 


uiU 


ir 11" 


Car Pro 




Phe 


He 


Asn 


340 








345 








350 




Tyr 


Asn 


Leu 


Cys 


Leu 


Thr Lys 


Asp 


Glu 


Ser Lys 


Arg 


Pro 


Lys 


355 






360 








3 65 






Ala 


Leu 


Leu 


Lys 


His 


Pro Phe 


He 


Leu 


Met Tyr 


Glu 


Glu Arg 






375 






380 








Ala 


Val 


Ala 


Cys 


Tyr 


Val Cys 


Lys 


He 


Leu Asp 


Gin 


Met 


Pro 






390 




393 














Ser 


Ser 


Pro 


Met 


Tyr Val 


Asp 














DRMATION 


FOR 


SEQ 


ID NO: 9 

















Lri 370 
mzl Glu 
3j85 

=Thr Pro 

i(2) INF' 

Tf (i) SEQUENCE CHARACTERISTICS : 
% (A) LENGTH: 3636 base pairs 

;L: (B) TYPE: nucleic acid 

H (C) STRANDEDNESS : single 

^ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
CTCCCAACAA TGGCGGCTCC GAGCCCGAGC GGCGGCGGCG GCTCCGGGGG CGGCAGCGGC 60 
AGCGGCACCC CCGGCCCCGT AGGGTCCCCG GCGCCAGGCC ACCCGGCCGT CAGCAGCATG 120 
CAGGGTAAAC GCAAAGCACT GAAGTTGAAT TTTGCAAATC CACCTTTCAA ATCTACAGCA 180 
AGGTTTACTC TGAATCCCAA TCCTACAGGA GTTCAAAACC CACACATAGA GAGACTGAGA 240 
ACACACAGCA TTGAGTCATC AGGAAAACTG AAGATCTCCC CTGAACAACA CTGGGATTTC 300 
ACTGCAGAGG ACTTGAAAGA CCTTGGAGAA ATTGGACGAG GAGCTTATGG TTCTGTCAAC 360 
AAAATGGTCC ACAAACCAAG TGGGCAAATA ATGGCAGTTA AAAGAATTCG GTCAACAGTG 420 
GATGAAAAAG AACAAAAACA ACTTCTTATG GATTTGGATG TAGTAATGCG GAGTAGTGAT 480 
TGCCCATACA TTGTTCAGTT TTATGGTGCA CTCTTCAGAG AGGGTGACTG TTGGATCTGT 540 
ATGGAACTCA TGTCTACCTC GTTTGATAAG TTTTACAAAT ATGTATATAG TGTATTAGAT 600 
GATGTTATTC CAGAAGAAAT TTTAGGCAAA ATCACTTTAG CAACTGTGAA AGCACTAAAC 660 
CACTTAAAAG AAAACTTGAA AATTATTCAC AGAGATATCA AACCTTCCAA TATTCTTCTG 720 
GACAGAAGTG GAAATATTAA GCTCTGTGAC TTCGGCATCA GTGGACAGCT TGTGGACTCT 780 
ATTGCCAAGA CAAGAGATGC TGGCTGTAGG CCATACATGG CACCTGAAAG AATAGACCCA 840 
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AGCGCATCAC GACAAGGATA TGATGTCCGC 
TATGAGTTGG CCACAGGCCG AT-TTCCTTAT 
ACACAAGTCG TGAAAGGAGA TCCTCCGCAG 
CCGAGTTTCA TCAACTTTGT CAACTTGTGC 
TATAAAGAGC TTCTGAAACA TCCCTTTATT 
GCATGCTATG TTTGTAAAAT CCTGGATCAA 
GTCGATTGAT ATCGCTGCTA CATCAGACTC 
TAAAGAATTT TCATCCCG7A TCACAGTGTT 
AAGATTGGTG TTCGTTTCCA TCATGTCTGT 
GTAATACCTG ATTGATCACA CAGTGTTAGT 
TGTGATGAAC ATATTCATGA AATGTGGAAG 
GATCACATCT TAAATTCATT TCTAGACTCA 
TTTTGTCAGA CTTCCAAATC CTGGAAGGAC 
GAAACTCGGG CTTGAGTGAG AAGAGCTTGC 
CTGGGAGACA AAGGAGGAAT TTACTTTCTT 
TTCTGTTGCT TTACAGTTAC AGTTGATGTT 
GTTTGAAATA TCATGTTAAA TTAGAATGAA 
C&AAGAGGTGA ACATTAAAAT ATAGAGACAG 
-if CCTATTTTT CAATGGGAAG ACTCAGGAGT 
yjAGAATTTTT CATTCTCAGA ATTCGGTGTG 
f S C AC CAGGAC TGAAAGAAGA AAACAGTACA 
iTAGTATTTTA TCTGGAACAA CTTGTAGCAG 
Sactttagcc ATCATAACTC ACTAACAGGG 
? jfATT AGATAA AGATTTCTAG TAGGCAGCAA 
*€ATCACTGGT CCAGGTCTTC AGTTTCCGAA 
'GCTATGTGAC TTGCGCTTAA TCCAATATTT 
^GTTAGCAGG GATGTTCCTT ACCGAGGATT 
'GTTTAAAAGG CTAAGAATAG TGGGGCCCAA 
TTCTAGAGAC ACATTGGACC AGATGAGGAT 
GCTAGAACCT CTCGTAGTCC ATCACCATTT 
ACAAAAAGCA AAACAAAACC CTCAGCACTG 
iCTTCACTTA CCCATTAGCC AGGTTCTCAT 
&CCTTAGGCT TTGTATGACA GTGAAGCAGC 
AAAACAGTCA TGGCCTGAGA TGCAGGTGAT 
TGCTGTGTCT CCTCTCAGAG TGACAGTCAT 
TGCTGTTTAA AGTCACATCC CTGTAAATTG 
TACTTGCCTC ATTTCCCTAT CTTCTCCCCC 
TTCTGAAAGG AGACATTGCT CTATGTCTGC 
TGCTCCCGGG GACTCAAGAG GAATCTGTTT 
ATAGGGTCAC TTTGCCATTA TGCAAATGGA 
AATCTGACCG TTCTATTGTG TGGATGACCA 
TAGATTATTA TAAACTATAA ACTTAAGGGC 
AACAAAAGGG TGTATAGTGT TCACAAACTG 
GCTCTGGTTA TTTTTCTCTT GTACCATAGA 
GTGCAGGGAT ATTGCCTTAT TTGTCTGTAA 
TGGAGCTTTG GAATATTTTA TCCTGTATTC 



TCTGATGTCT GGAGTTTGGG GATCACATTG 900 
CCAAAGTGGA ATAGTGTATT TGATCAACTA 960 
CTGAGTAATT CTGAGGAAAG GGAATTCTCC 1020 
CTTACGAAGG ATGAATCCAA AAGGCCAAAG 1080 
TTGATGTATG AAGAACGTGC CGTTGAGGTC 1140 
ATGCCAGCTA CTCCCAGCTC TCCCATGTAT 1200 
TAGAAAAAAG GGCTGAGAGG AAGCAAGACG 1260 
TTTATTGCTC GCCCAGACAC CATGTGCAAT 1320 
ATACTCCTGT CACCTAGAAC GTGCATCCTT 1380 
GCTGGTCAGA GAGACCTCAT CCTGCTCTTT 1440 
TCAGTACGAT CAAGTTGTTG ACTGTGATTA 1500 
AAACCTGGAG ATGCAGCTAC TGGAATGGTG 1560 
ACAGTGATGA ATGTACTATA TCTGAACATA 1620 
ACAGCCAACG AGACACATTG CCTTCTGGAG 1680 
CACCAAGTGC AATAGATTAC TGATGTGATA 1740 
TGGGGATCGA TGTGCTCAGC CAAATTTCCT 1800 
TTTATCTTTA CCAAAAACCA TGTTGCGTTC 1860 
GACAGAATGT GTTCTTTTCT CCTCTACCAG 1920 
CTGCCACTTG TCAAAGAAGG TGCTGATCCT 1980 
CTGCCAACTT GATGTTCCAC CTGCCACAAA 2040 
GAAGGCAAAG TTTACAGATG TTTTTAATTC 2100 
CTATATATTT CCCCTTGGTC CCAAGCCTGA 2160 
AGAAGTAGCT AGTAGCAATG TGCCTTGATT 2220 
AAGACGAAAT CTCAGTTGTT TGCTTCTTGC 2340 
TCTCTTTCCC TTCCCCTGTG GTCTATTGTC 2400 
TGCCTTTTTT CTATATCAAA AAACCTTTAC 2460 
TTTAACCCCC AATCTCTCAT AATCGCTAGT 2520 
CCGATGTGGT AGGTGATAAA GAGGCATCTT 2580 
CCGAAACGGC AGCCTTTACG TTCATCACCT 2640 
CTTGGCATTG GAATTCTACT GGAAAAAAAT 2700 
TTACAAGAGG CCATTTAAGT ATCTTGTGCT 2760 
TAGGTTTTGC TTGGGCCTCC CTGGCACTGA 2820 
ACTGTGAGTG GTTCAAGCAC ACTGGAATAT 2880 
GCCATTACAG AACCAAATCG TGGCACGTAT 2940 
AAATACTGTC AAACAATAAA GGGAGAATGG 3000 
CAGAATTCAA AAGTGATTAT CTCTTTGATC 3 060 
ACGGTATCCT AAACTTTAGA CTTCCCACTG 3120 
CTTCGACCAC AGCAAGCCAT CATCCTCCAT 3180 
CTCTGCTGTC AACTTCCCAT CTGGCTCAGC 3240 
GATAAAAGCA ATTCTGGCTG TCCAGGAGCT 3300 
CATAAGAAGG CAATTTTAGT GTATTAATCA 3360 
AAGGAGTTTA TTACAATGTA TCTTTATTAA 3420 
TGAAAATAGT GTAAGAACTG TACATTGTGA 3480 
AAAATGTATA AAAATTATCA AAAAGCTAAT 3 540 
AAAATGGAGC TCAGTAACAT AACTGCTTCT 3600 
TTGTTT 3636 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS : 



(A) LENGTH: 399 amino acids 

(B) TYPE: peptide 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



5 10 15 



ric t, 


nld 


Al a 
AX d 




Oar 
OCX 




OCX 


Glv 


Glv Glv 
uxy oxy 


Gl v 
ox y 


Car 

OCX 


Vjrx y vjly uly ocx 








20 










25 






^ 0 


\jm y 


Oa r 

O C X 




Thr- 


-t X w 


Glv 


prn 

.t X W 


Val 


Glv Ser 

wX jf wCX 


* X w 


Al a 
.nx a 


Prn Gl v Ui c Drn 
rx u vjxy nxo irX\J 






35 










40 








45 


Al a 


v a x 


Car 


OCX 


Met 


Gin 


Glv 


Lys 


£^X ^ XJ J « 


Ala 

nx A 


TiPi i 

XJC IX 


TiVQ TiOii Aen DHo 
i_iyo xjcul Aoii rue 




50 










55 








60 




Al a 


noil 




JT X Vj 


Phe 


T ,VC! 
xjy o 


OCX 


Thr 

X 11X 


Al a Atct 

nx a ax y 


Phe 

XT 11C 


Thr 

X 11X 


T.pn Aen Pm Acn 
xtc li ao ii riu AO 11 


O -J 










70 








75 




80 




Th r 
1 11X 


vjjx y 


Val 

V CI. X 


Gin 

will 


Asn 


Pro 

X X W 


His 

1 XX i3 


He Glu 

XXe wx ux 


x^x y 


XJC \JL 


A yrr Thr* Ui q Ser 
ax y x xix xxxo ocx 








85 








90 






95 


lie 


vjrX U 


Coy* 

OCX 


Cav 
OCX 


ui y 


xjy o 


T.A11 
LlCU 


xjy o 


Tip qar 

11C OCX 


Pm 

C X \J 


Gl n 


Gl n Ui e Trn Ian 
vjlll xllo lip nop 








X W V 










105 






110 

X X \J 


?3 Ih a 


Tn v 

i nr 


riJ»a 


gi ii 




XiC LA 


xjy o 


Zicn 

AO p 


xjc ui vjxy 


Gl n 

wX UX 


Tie 

X xc 


Gl v Arrr Gl v Ala 
vjx y ax y vjxy ax a 






1 1 C 
X X 3 










120 

X <b W 








125 

X Ai O 


f 


oiy 


OCX 


Val 


AO 11 


xjy o 


Mob 


Val 
v ax 


til e T.VQ 

nxo xiy o 


Pm 

X X w 


Car 
OCX 


Glv Gin Tip Mor 
\axy V7Xxi xxc i T i c u 














135 

X J 








140 

X w 




■7\1 ^ 

£*ia 


vai 


Lys 


ATy 


Tl o 
11c 


Arg 


Car 
OCX 


Th t* 
1 xlX 


Va 1 A en 
Val nop 


Gl ii 


ljy S 


n /Til n T t/c nl n 

uiU vjin ijys uin 


3X §± 3 










150 

X J w 








155 

X <J 




160 

X W w 


;£j6U 


lacU 


Mot" 


A on 
nop 


T.oi i 

XJCU. 


A en 
AO Ly 


Val 

V CLX 


Val 

v ax 


Ma f- A r<T 
i IC L« ax y 


Car 
OCX 


Car 

OCX 


A en fvQ Pm Tvr 
AOLy \-»y o rxu xyx 










165 








170 






175 


T "1 a 


Val 
Val 


nl n 
vjtXII 


Phe 


iyr 


vjtx y 


Ala 

AX a 




Prie A yrr 
XT nc ax y 


Gl u 

V7X \Jk 


Glv 
wX y 


A en fve Ttti Tie 
nop v^yo up xxc 








180 










185 






190 






nl n 

wX LL 


XJCwi 


Met 

* IC U> 


Car 

O CX 


Th*r 

X XXX 


Ser 


Phe Asn 


TiV^ 

xj y o 


Phe 

x xxe 


Tvr T<vq Tvr V^al 

xyx uy 0 xyx v ax 






195 










200 








205 


:t T*-t r"y« 
ff^ r 




V al 


T .oi i 

XiCU. 


A en 

AO£> 


AO Ly 


Val 

v ax 


lie 

X xc 


Pr*n Gl ii 
nu \jx Lx 


Gl u 
oiu 


Tie 

X xc 


T ,oi i mv T.vg Tie 
ucu vjx y xtyo xxc 




X \J 










215 








220 






Leu 


A 1 -a 

Ala 


1 ill 


Val 

V Ctl 


xjy o 


Al a 


T iOi i 

XJC U. 


AO 11 xlxO 


T.oii 


xjy o 


nl ii Aen T.oii Ta/q 
VjlU noil xjcU liyo 


^ ^ j 










230 








235 




240 


T 1 a 


lie 


tin a 

rilS 




Aop 


Tl e 
lie 


xay o 


xx *J 


Car A en 

OCX AO XX 


Tl o 
lie 


T ,oi i 

XJCU 


T.oii Aen A y<r Qar 
xjcU nop AXy OCX 










245 








250 






255 


Gly 


Asn 


He 


Lys 


Leu 


Cys 


Asp 


Phe 


Gly He 


Ser 


Gly 


Gin Leu Val Asp 








260 










265 






270 


Ser 


lie 


Ala 


Lys 


Thr 


Arg 


Asp 


Ala 


Gly Cys 


Arg 


Pro 


Tyr Met Ala Pro 






275 










280 








285 


Glu 


Arg 


He 


Asp 


Pro 


Ser 


Ala 


Ser 


Arg Gin 


Gly 


Tyr 


Asp Val Arg Ser 




290 










295 








300 




Asp 


Val 


Trp 


Ser 


Leu 


Gly 


He 


Thr 


Leu Tyr 


Glu 


Leu 


Ala Thr Gly Arg 


305 










310 








315 




320 


Phe 


Pro 


Tyr 


Pro 


Lys 


Trp 


Asn 


Ser 


Val Phe 


Asp 


Gin 


Leu Thr Gin Val 










325 








330 






335 


Val 


Lys 


Gly 


Asp 


Pro 


Pro 


Gin 


Leu 


Ser Asn 


Ser 


Glu 


Glu Arg Glu Phe 








340 










345 






350 


Ser 


Pro 


Ser 


Phe 


He 


Asn 


Phe 


Val 


Asn Leu 


Cys 


Leu 


Thr Lys Asp Glu 



- 62 - 



355 360 365 

Ser Lys Arg Pro Lys Tyr Lys Glu Leu Leu Lys His Pro Phe lie Leu 

370 375 380 

Met Tyr Glu Glu Arg Ala Val Glu Val Ala Cys Tyr Val Cys Lys He 
385 390 395 399 

Leu Asp Gin Met Pro Ala Thr Pro Ser Ser Pro Met Tyr Val Asp 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 93 amino acids 

(B) TYPE: peptide 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

5 10 15 



Met 


Pro 


Lys 


Lys 


Lys 


Pro 


Thr 


Pro 


He 


Gin 


Leu 


Asn 


Pro Ala Pro 


Asp 






20 










25 








30 




^fly 


Ser 


Ala 


Val 


Asn 


Gly 


Thr 


Ser 


Ser 


Ala 


Glu 


Thr 


Asn Leu Glu 


Ala 


w 




35 








40 










45 






Gin 


Lys 


Lys 


Leu 


Glu 


Glu 


Leu 


Glu 


Leu 


Asp 


Glu 


Gin Gin Arg 


Lys 




50 








55 










60 








Leu 


Glu 


Ala 


Phe 


Leu 


Thr 


Gin 


Lys 


Gin 


Lys 


Val 


Gly Glu Leu 


Lys 


lfJ65 










70 










75 






80 


CSsp 


Asp 


Asp 


Phe 


Glu 


Lys 


He 


Ser 


Glu 


Leu 


Gly 


Ala 


Gly Asn Gly 


Gly 






85 










90 






95 




~Val 


Val 


Phe 


Lys 


Val 


Ser 


His 


Lys 


Pro 


Ser 


Gly 


Leu 


Val Met Ala 


Arg 








100 










105 








110 




iLys 


Leu 


He 


His 


Leu 


Glu 


He 


Lys 


Pro 


Ala 


He 


Arg 


Asn Gin He 


He 






115 










120 










125 




?Arg 


Glu 


Leu 


Gin 


Val 


Leu 


His 


Glu 


Cys 


Asn 


Ser 


Pro 


Tyr He Val 


Gly 


130 






4 




135 










140 






Hhe 


Tyr 


Gly 


Ala 


Phe 


Tyr 


Ser 


Asp 


Gly 


Glu 


He 


Ser 


He Cys Met 


Glu 


? l45 






150 










155 






160 


His 


Met 


Asp 


Gly 


Gly 


Ser 


Leu 


Asp 


Gin 


Val 


Leu 


Lys 


Lys Ala Gly 


Arg 










165 










170 






175 




He 


Pro 


Glu 


Gin 


He 


Leu 


Gly 


Lys 


Val 


Ser 


He 


Ala 


Val He Lys 


Gly 








180 










185 








190 




Leu 


Thr 


Tyr 


Leu 


Arg 


Glu 


Lys 


His 


Lys 


He 


Met 


His 


Arg Asp Val 


Lys 






195 










200 










205 




Pro 


Ser 


Asn 


He 


Leu 


Val 


Asn 


Ser 


Arg 


Gly 


Glu 


He 


Lys Leu Cys 


Asp 




210 










215 










220 






Phe 


Gly 


Val 


Ser 


Gly 


Gin 


Leu 


He 


Asp 


Ser 


Met 


Ala 


Asn Ser Phe 


Val 


225 








230 










235 






240 


Gly 


Thr 


Arg 


Ser 


Tyr 


Met 


Ser 


Pro 


Glu 


Arg 


Leu 


Gin 


Gly Thr His 


Tyr 








245 










250 






255 




Ser 


Val 


Gin 


Ser 


Asp 


He 


Trp 


Ser 


Met 


Gly 


Leu 


Ser 


Leu Val Glu 


Met 








260 










265 








270 




Ala 


Val 


Gly 


Arg 


Tyr 


Pro 


He 


Pro 


Pro 


Pro 


Asp 


Ala 


Lys Glu Leu 


Glu 
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275 280 285 



Leu 


Met 


Phe 


Gly 


Cys 


Gin 


Val 


Glu 


Gly 


Asp 


Ala 


Ala Glu 


Thr 


Pro 


Pro 




290 










295 










300 








Arg 


Pro 


Arg 


Thr 


Pro 


Gly 


Arg 


Pro 


Leu 


Ser 


Ser 


Tyr Gly 


Met 


Asp 


Ser 


305 










310 










315 








320 


Arg 


Pro 


Pro 


Met 


Ala 


lie 


Phe 


Glu 


Leu 


Leu 


Asp 


Tyr He 


Val 


Asn 


Glu 








325 










330 








335 








Pro 


Lys 


Leu 


Pro 


Ser 


Gly 


Val 


Phe 


Ser 


Leu Glu 


Phe 


Gin 


Asp 








340 










345 








350 






Phe 


Val 


Asn 


Lys 


Cys 


Leu 


He 


Lys 


Asn 


Pro 


Ala 


Glu Arg 


Ala Asp 


Leu 






355 










360 








365 








Lys 


Gin 


Leu 


Met 


Val 


His 


Ala 


Phe 


He 


Lys 


Arg 


Ser Asp 


Ala 


Glu 


Glu 


370 










375 










380 








Val 


Asp 


Phe 


Ala 


Gly 


Trp 


Leu 


Cys 


Ser 


Thr 


He 


Gly Leu 


Asn 


Gin 


Pro 


385 








390 






393 














Ser 


Thr 


Pro 


Thr 


His 


Ala 


Ala 


Gly 


Val 














%) 


INFORMATION 


FOR 


SEQ 


ID NO: 12: 















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 amino acids 

(B) TYPE: peptide 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.12: 











5 










10 








15 




■Met 


Leu 


Ala 


Arg 


Arg 


Lys 


Pro 


Val 


Leu 


Pro 


Ala 


Leu 


Thr 


He Asn 


Pro 








20 










25 










30 




Shr 


He 


Ala 


Glu 


Gly 


Pro 


Ser 


Pro 


Thr 


Ser 


Glu 


Gly Ala 


Ser Glu 


Ala 






35 










40 










45 






jAsn 


Leu 


Val 


Asp 


Leu 


Gin 


Lys 


Lys 


Leu 


Glu 


Glu 


Leu 


Glu 


Leu Asp 


Glu 




50 








55 










60 








Win 


Gin 


Lys 


Lys 


Arg 


Leu 


Glu 


Ala 


Phe 


Leu 


Thr 


Gin 


Lys 


Ala Lys 


Val 


r ~65 








70 










75 








80 


Ser 


Glu 


Leu 


Lys 


Asp 


Asp 


Asp 


Phe 


Glu 


Arg 


He 


Ser 


Glu 


Leu Gly 


Ala 








85 










90 








95 




Gly 


Asn 


Gly 


Gly 


Val 


Val 


Thr 


Lys 


Val 


Gin 


His 


Arg 


Pro 


Ser Gly 


Leu 




100 










105 










110 




He 


Met 


Ala 


Arg 


Lys 


Leu 


He 


His 


Leu 


Glu 


He 


Lys 


Pro 


Ala He 


Arg 






115 








120 










125 






Asn 


Gin 


He 


He 


Arg 


Glu 


Leu 


Gin 


Val 


Leu 


His 


Glu 


Cys 


Asn Ser 


Pro 




130 










135 










140 








Tyr 


He 


Val 


Gly 


Phe 


Tyr 


Gly 


Ala 


Phe 


Tyr 


Ser 


Asp Gly 


Glu He 


Ser 


145 










150 










155 








160 


He 


Cys 


Met 


Glu 


His 


Met 


Asp 


Gly 


Gly 


Ser 


Leu 


Asp Gin 


Val Leu 


Lys 








165 










170 








175 




Glu 


Ala 


Lys 


Arg 


lie 


Pro 


Glu 


Glu 


He 


Leu 


Gly 


Lys Val 


Ser He 


Ala 






180 










185 










190 




Val 


Leu 


Arg 


Gly 


Leu 


Ala 


Tyr 


Leu 


Arg 


Glu 


Lys 


His 


Gin 


He Met 


His 
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195 










200 










205 






Arg 


Asp 


Val 


Lys 


Pro 


Ser 


Asn 


He 


Leu 


Val 


Asn 


Ser 


Arg 


Gly Glu 


lie 


210 










215 










220 








Lys 


Leu 


Cys 


Asp 


Phe 


Gly Val 


Ser 


Gly 


Gin 


Leu 


He 


Asp 


Ser Met 


Ala 


225 










230 










235 








240 


Asn 


Ser 


Phe 


Val 


Gly 


Thr 


Arg 


Ser 


Tyr 


Met 


Ala 


Pro 


Glu 


Arg Leu 


Gin 










245 










250 








255 




Gly 


Thr 


His 


Tyr 


Ser 


Val 


Gin 


Ser 


Asp 


lie 


Trp 


Ser 


Met 


Gly Leu 


Ser 






260 










265 










270 




Leu 


Val 


Glu 


Leu 


Ala 


Val 


Gly 


Arg 


Tyr 


Pro 


He 


Pro 


Pro 


Pro Asp 


Ala 






275 










230 










285 






Lys 


Glu 


Leu 


Glu 


Ala 


He 


Phe 


Gly 


Arg 


Pro 


Val 


Val 


Asp 


Gly Glu 


Glu 


290 










295 










300 








Gly 


Glu 


Pro 


His 


Ser 


He 


Ser 


Pro 


Arg 


Pro 


Arg 


Pro 


Pro 


Gly Arg 


Pro 


305 










310 










315 








320 


Val 


Ser 


Gly 


His 


Gly 


Met 


Asp 


Ser 


Arg 


Pro 


Ala 


Met 


Ala 


He Phe 


Glu 








325 










330 








335 




«ifeu 


Leu 


Asp 


Tyr 


He 


Val 


Asn 


Glu 


Pro 


Pro 


Pro 


Lys 


Leu 


Pro Asn 


Gly 






340 










345 










350 




Val 


Phe 


Thr 


Pro 


Asp 


Phe 


Gin 


Glu 


Phe 


Val 


Asn Lys 


Cys 


Leu lie 


Lys 






355 










360 










365 






ISsn 


Pro 


Ala 


Glu 


Arg 


Ala Asp 


Leu 


Lys 


Met 


Leu 


Thr 


Asn 


His Thr 


Phe 




370 










375 










380 








Ule 


Lys 


Arg 


Ser 


Glu 


Val 


Glu 


Glu 


Val 


Asp 


Phe Ala Gly 


Trp Leu 


Cys 








390 










395 








400 


=Lys 


Thr 


Leu 


Arg 


Leu 


Asn 


Gin 


Pro 


Gly 


Thr 


Pro Thr Arg 


Thr Ala 


Val 


&> 


INFORMATION 


FOR 


SEQ 


ID NO: 13: 















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 amino acids 

(B) TYPE:' peptide 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 











5 










10 




15 




Met 


Glu 


Asp 


Lys 


Phe 


Ala 


Asn 


Leu 


Ser 


Leu 


His Glu Lys 


Thr Gly 


Lys 






20 










25 






30 




Ser 


Ser 


He 


Gin 


Leu 


Asn 


Glu 


Gin 


Thr 


Gly 


Ser Asp Asn 


Gly Ser 


Ala 






35 










40 






45 






Val 


Lys 


Arg 


Thr 


Ser 


Ser 


Thr 


Ser 


Ser 


His 


Tyr Asn Asn 


lie Asn 


Ala 




50 








55 








60 






Asp 


Leu 


His 


Ala 


Arg 


Val 


Lys 


Ala 


Phe 


Gin 


Glu Gin Arg 


Ala Leu 


Lys 


65 










70 










75 




80 


Arg 


Ser 


Ala 


Ser 


Val 


Gly 


Ser 


Asn 


Gin 


Ser 


Glu Gin Asp 


Lys Gly 


Ser 










85 










90 




95 




Ser 


Gin 


Ser 


Pro 


Lys 


His 


He 


Gin 


Gin 


lie 


Val Asn Lys 


Pro Leu 


Pro 








100 










105 






110 




Pro 


Leu 


Pro 


Val 


Ala 


Gly 


Ser 


Ser 


Lys 


Val 


Ser Gin Arg 


Met Ser 


Ser 



- 65 - 



I 







115 






120 


Gin 


Val 


Val 


Gin 


Ala 


Ser -Ser Lys 




130 








135 


Asn 


Gin 


Glu 


Thr 


Gin 


Asn lie Thr 


145 










150 


Thr 


Lys 


He 


Thr 


Ala 


Thr Thr lie 








165 




Thr 


Asp 


He 


Thr 


Pro 


Ser Val Ser 






180 






Ala 


Gin 


Leu 


Leu 


Asn 


Pro Asn Arg 






195 






200 


Thr 


Gin 


His 


Pro 


Thr 


Arg Pro Asn 




210 








215 


He 


He 


Asn 


Thr 


Pro 


Lys Gin Ser 


225 










230 


Leu 


Pro 


Pro 


Gly 


Gly 


Met Ser Leu 










245 




Clin 


Pro 


Gin 


Gin 


Phe 


Ala Pro Ser 


P 






260 






'"■■■-Thr 


Leu 


Ser 


Asn 


Ser 


Lys Val Val 


f f 1 




275 






280 


iSer 


Leu 


He 


Asn 


Gly 


Val Gin Ser 




290 








295 


ffGly 


Pro 


His 


Asp 


Thr 


Val Gly Thr 


l§05 










310 


l%sn 


Ser 


Ser 


Asn 


Ser 


Gly Ser Ser 










325 




jj>he 


Ser 


Lys 


Tyr 


Val 


Asp lie Lys 








340 






l%ys 


Leu 


Ser 


Leu 


Ser 


Ser Lys Gly 






355 




* 


360 


}^Ser 


Arg 


He 


Thr 


Leu 


Asp Glu Leu 




370 








375 


Gly 


Asn 


Tyr 


Gly 


Asn 


Val Ser Lys 


385 










390 


He 


Met 


Ala 


Thr 


Lys 


Glu Val Arg 










405 




Arg 


Gin 


He 


Leu 


Met 


Glu Leu Glu 






420 






Tyr 


He 


Val 


Asp 


Phe 


Tyr Gly Ala 






435 






440 


Met 


Cys 


Met 


Glu 


Tyr 


Met Asp Gly 




450 








455 


Glu 


Ser 


Ser 


Glu 


He 


Gly Gly lie 


465 










470 


Ala 


Asn 


Ala 


Val 


lie 


His Gly Leu 










485 




He 


He 


His 


Arg 


Asp 


Val Lys Pro 



125 



Ser 


Thr 


Leu 


Lys 


Asn 


Val 


Leu 


Asp 








140 










Asp 


Val 


Asn 


He 


Asn 


He 


Asp 


Thr 






155 










160 


Gly 


Val 


Asn 


lie 


Gly 


Leu 


Pro 


Ala 




170 










175 




Asn 


Thr 


Ala 


Ser 


Ala 


Thr 


His 


Lys 


185 










190 






Arg 


Ala 


Pro 


Arg 


Arg 


Pro 


Leu 


Ser 










205 








val 


Ala 


Pro 


His 


Lys 


Ala 


Pro 


Ala 








220 










Leu 


Ser 


Ala 


Arg 


Arg 


Gly 


Leu 


Lys 






235 










240 


Lys 


Met 


Pro 


Thr 


Lys 


Thr 


Ala 


Gin 




250 










255 




Pro 


Ser 


Asn 


Lys 


Lys 


His 


lie 


Glu 


265 










270 






Glu 


Gly 


Lys 


Arg 


Ser 


Asn 


Pro 


Gly 










285 








Thr 


Ser 


Thr 


Ser 


Ser 


Ser 


Thr 


Glu 








300 










Thr 


Pro 


Arg 


Thr 


Gly 


Asn 


Ser 


Asn 






315 










320 


Gly 


Gly 


Gly 


Gly 


Leu 


Phe 


Ala 


Asn 




330 










335 




Ser 


Gly 


Ser 


Leu 


Asn 


Phe 


Ala 


Gly 


345 










350 






He 


Asp 


Phe 


Ser 


Asn 


Gly 


Ser 


Ser 










365 








Glu 


Phe 


Leu 


Asp 


Glu 


Leu 


Gly 


His 








380 










Val 


Leu 


His 


Lys 


Pro 


Thr 


Asn 


val 






395 








400 


Leu 


Glu 


Leu 


Asp 


Glu 


Ala 


Lys 


Phe 




410 










415 




Val 


Leu 


His 


Lys 


Cys 


Asn 


Ser 


Pro 


425 










430 






Phe 


Phe 


He 


Glu 


Gly 


Ala 


val 


Tyr 










445 








Gly 


Ser 


Leu 


Asp 


Lys 


lie 


Tyr 


Asp 








460 










Asp 


Glu 


Pro 


Gin 


Leu 


Ala 


Phe 


lie 






475 










480 


Lys 


Glu 


Leu 


Lys 


Glu 


Gin 


His 


Asn 




490 








495 




Thr 


Asn 


lie 


Leu 


Cys 


Ser 


Ala 


Asn 
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500 










505 










510 




Gin 


Gly Thr 


Val 


Lys 


Leu 


Cys 


Asp 


Phe 


Gly 


Val 


Ser Gly Asn Leu Val 




515 










520 










525 






Ala 


Ser Leu 


Ala 


Lys 


Thr 


Asn 


lie 


Gly 


Cys 


Gin 


Ser 


Tyr Met 


Ala Pro 




530 








~i r* 

535 










540 








Glu 


Arg lie 


Lys 


Ser 


Leu 


Asn 


Pro 


Asp 


Arg 


Ala 


Thr 


Tyr 


Thr 


Val Gin 


545 








cr c r\ 

550 










r p r 

555 








560 


Ser 


Asp lie 


Trp 


Ser 


Leu 


Gly 


Leu 


Ser 


Tl- 


Leu 


Glu 


Met 


Ala 


Leu Gly 






c c c 










b / u 










575 


Arg 


Tyr Pro 


Tyr 


Pro 


Pro 


Glu 


Tnr 


Tyr 


Asp 


Asn 


lie 


Phe 


Ser 


Gin Leu 




580 










bob 










590 




Ser 


Ala lie 


Val 


Asp 


Gly 


Pro 


Pro 


Pro 


Arg 


Leu 


Pro 


Ser 


Asp 


Lys Phe 




595 










bUU 










605 






Ser 


Ser Asp 


Ala 


Gin 


Asp 


Phe 


Val 


Ser 


Leu 


Cys 


Leu 


Gin 


Lys 


He Pro 






















620 








Glu 


Arg Arg 


Pro 


Thr 


Tyr 


Ala 


Ala 


Leu 


Thr 


Glu 


His 


Pro 


Trp 


Leu Val 


525 






630 










635 








640 




Tyr Arg 


Asn 


Gin 


Asp 


Val 


His 


Met 


Ser 


Glu 


Tyr 


He 


Thr Glu Arg 






645 










650 










655 


L#U 


Glu Arg 


Arg 


Asn 


Lys 


He 


Leu 


Arg 


Glu 


Arg 


Gly Glu Asn Gly Leu 






660 










605 






608 










Lys Asn 


Val 


Pro 


Ala 


Leu 


His 


Met 


Gly 


Gly 


Leu 









|=2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: ^3 base pairs 
IL (B) TYPE: nucleic acid 

H (C) STRANDEDNESS : single 

f' k (D) TOPOLOGY: linear 

Li (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
tpTYTAYGGNG CNTTYTTYAT HGA 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3*3^^ase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ATBCTYTCNG GNGCCATKTA 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: peptide 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
5 . 8 

Asp Tyr Lys Asp Asp Asp Asp Lys 



□ 

m 

Ul 
in 

s 

C3 
H 

m 
a 
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